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We evoke situations where large fluctuations in the entropy are in- 
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q\ , duced, our main example being a spacetime containing a potential black 

hole whose formation depends on the outcome of a quantum mechanical 
event. We argue that the teleological character of the event horizon im- 
plies that the consequent entropy fluctuations must be taken seriously in 
any interpretation of the quantal formalism. We then indicate how the 
entropy can be well defined despite the teleological character of the hori- 
zon, and we argue that this is possible only in the context of a spacetime 
or "histories" formulation of quantum gravity, as opposed to a canonical 
one, concluding that only a spacetime formulation has the potential to 
compute - - from first principles and in the general case - - the entropy 
of a black hole. ceFrom the entropy fluctuations in a related example, we 
also derive a condition governing the form taken by the entropy, when it 
is expressed as a function of the quantal density-operator. 



I. Introduction 

For a system in equilibrium, its thermodynamic entropy is by definition a constant, 
but its statistical mechanical "entropy a la Boltzmann" , being a measure of the number of 
microstates making up the given "macro" or "meso-" state, can, and in fact does fluctuate. 
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Since, in equilibrium, the probability of a fluctuation associated with the entropy change 
AS varies as e _AS , large fluctuations of the entropy are very unlikely. Nevertheless the 
existence, even of small fluctuations is important in principle, and in certain situations 
they can give rise to readily observable effects like critical opalescence. 

Perhaps the most far reaching implication of entropy fluctuations on the theoretical 
side is that the law of entropy increase can survive them as an exact relationship at best 
in an average sense. Given this, it seems important that at least some averaged version of 
the entropy obey an exact law of increase, because otherwise perpetual motion machines 
would no longer be excludable on the basis of statistical mechanics. But whether or not 
the average entropy can decrease will depend on the details of how the average is taken, 
and in this way we can obtain some guidance as to which definition of the entropy is most 
appropriate. For reasons such as these, a situation where large entropy fluctuations occur 
with high probability would seem to be of particular interest. In this paper, we will evoke 
such a situation. 

We will in fact describe two such situations, one involving a "Schrodinger cat" like 
apparatus, the other involving a black hole and relying crucially on the teleological char- 
acter of its horizon — the fact that it responds in a certain sense to events that are still in 
the future. Although the validity of the first example can depend on how one interprets 
quantum mechanics, we believe that that of the black hole example cannot. 

The black hole example is important for another reason, as we will try to convince 
the reader. It illustrates the point that not every general relativistic body lends itself to 
description in terms of data on a spacelike hypersurface [R::forks]. Rather, the teleological 
nature of the horizon seems to demand description within a spacetime or "histories" frame- 
work, and indeed in a framework which extends those usually considered by incorporating 
histories that proceed into the distant future and then return to the present. 

The detailed working out of these points will be found in the remainder of the paper. 
Section II poses the problem of how to define the entropy in the face of its fluctuations; 
Section III analyzes the "Schrodinger cat" example and shows how it can guide us in 
selecting a suitable formula for the statistical mechanical entropy; Section IV introduces 
our second example involving a quantum mechanical event whose outcome will determine 
the formation or not of a black hole, implying a macroscopically large fluctuation in the 



entropy associated with hypersurfaces at slightly earlier times; Section V schematically 
evaluates a certain expectation value of the area of the black hole of this example within 
an extended path integral formalism; and Section VI attempts (without full success) to 
justify the identification of that expected area with the entropy of the black hole. In 
Section VII, we discuss our results and offer some conclusions. 

II. The uses of large entropy fluctuations 

It is not our purpose here to delve deeply into the meaning of entropy or to attempt to 
settle any of the persistent conceptual questions that attach to that concept. We merely 
want to recall some of the relevant definitions in order to fix our terminology and help 
establish a context for the "Gedankenexperimenten" we will consider. 

Within classical statistical mechanics one can distinguish two, somewhat different 
notions of entropy. The first, which we may call the "Boltzmann entropy", depends on 
the exact microstate of the system under consideration and is defined as the logarithm 
of the number of microstates belonging to the same equivalence class as the given one. 
The equivalence relation here is that of being "macroscopically indistinguishable" , and the 
equivalence classes may be called mesostates, using the terminology of [R::stoch-manifold] 
and [R::meso-def]. If Ni denotes the number of microstates making up the i th mesostate, * 
then the Boltzmann entropy is, 

S = log Ni (Er.SB) 

whenever the microstate finds itself within the i th mesostate. Although one can argue ' 
that the Boltzmann entropy tends strongly to increase, any possibility of formulating this 
tendency as an exact law is excluded by the existence of downward fluctuations which 
normally are small, but which also can be very large, albeit with what is usually taken to 
be exponentially small probability. 



* Within classical physics, the counting of states or Boltzmannian "complexions" can be 
done, at best, up to an arbitrary normalization, of course. 

t e.g. using the fact that the time spent in the i th mesostate is proportional to Ni 



In order to achieve an exact law of entropy increase despite the fluctuations, one can 
attempt to replace the exact equation of motion for the microstate by an approximate 
dynamics taking the form of a Markov process on the space of mesostates (see [R::stoch- 
manifold] for some references). To the extent that such an approximation can be justified 
one recovers an exact law of increase, not for the Boltzmann entropy, but for what one 
may call the "Gibbs entropy" , 

Y J P l logiV, + J2p* MVPi), (E :: Gibbs - S) 

% i 

where pi is the probability of realization of the ith state of the Markov process (the ith 
mesostate) . * Notice that the Gibbs entropy is an "ensemble functional" , rather than a 
function of the actual physical microstate (or even mesostate) of the system. Reinterpreted 
in terms of a probability density p on the space of microstates, it is (up to an additive 
constant) 

/<M*)log(l/pM), (E,SG -class) 

where the function p is chosen to be constant within each mesostate; i.e. p is that function 
which correctly reproduces the overall occupation probabilities pi of the mesostates and 
weights equally each microstate within a given mesostate. One sees from (E::Gibbs-S) 
that the Gibbs entropy can be interpreted as the average value of the Boltzmann entropy, 
Si = log Ni, augmented by a term capturing the uncertainty concerning which mesostate 
the system is actually in. (Conversely the Boltzmann entropy (E::SB) can be interpreted 
as the Gibbs entropy evaluated on an ensemble distributed evenly over the microstates of 
the i th mesostate.) 

Thus, we obtain the second law of thermodynamics in an exact form, but only at 
the double cost of (1) reinterpreting the increasing entropy as an ensemble function rather 
that a state function, and (2) working with an approximate dynamics rather than the 
exact one. The passage from an exact micro-system to an approximating meso-system has 
been called "coarse graining" , and some such procedure seems always to play a role in any 
derivation of the law of entropy increase, although the "meso-system" , need not always 



* In writing the first term of (E::Gibbs-S), we have assumed, in accordance with Liouville's 
theorem, that the particular probability distribution given by pi oc Ni is time invariant 
with respect to the Markov process. 



arise from so simple a transformation as grouping the microstates into equivalence classes. 
(Often it does arise this way, as when one defines an approximate stochastic dynamics for a 
Brownian particle by averaging over the degrees of freedom of the "reservoir" particles; here 
the position, and possibly the velocity, of the Brownian particle serve to parameterize the 
mesostate. In contrast, there are other schemes, such as that of the "BBGKY" hierarchy, 
that don't clearly incorporate any well-defined space of mesostates, or stochastic processes 
thereon, at all. Particularly interesting in this connection is the Boltzmann equation. On 
one hand, it might be viewed as a deterministic (and nonlinear!) dynamics on a certain 
space of mesostates, the space of 1-particle distribution functions. On this view, the 
function H of the so called "H -theorem" would be a species of Boltzmann entropy. On 
the other hand, the Boltzmann equation might alternatively be seen as first step in the 
"BBGKY hierarchy" , in which case it would be an equation of motion for a probability 
distribution on a multi-particle phase space, and the function H would appear as a type 
of Gibbs entropy.) 

For a quantum system the situation is similar, but with what would seem to be a 
greater degree of conceptual confusion overall. The actual counting of states is more 
satisfactory of course, but the distinction between the Boltzmann and the Gibbs entropies 
seems less clear cut, and depends partly on how one interprets the quantum formalism. 
There is disagreement, for example, over whether the classical contrast between microstate 
and ensemble does or does not correspond to the quantum distinction between state vector 
ip and density operator p. Also the meaning of the coarse graining leading to the concept 
of mesostate seems more obscure. For example, if one chooses to identify the state- vector 
ip with a microstate and some family of subspaces of the quantal Hilbert space with the 
space of mesostates, then one must deal with the fact that z/> will almost never be "in" any 
single mesostate. In the following , we will ignore such subtleties as much as possible, and 
content ourselves with the fact that fairly naive definitions of the Boltzmann and Gibbs 
entropies will suffice for most of the questions we deal with. (We also will not always 
specify Gibbs vs. Boltzmann entropy when the context does not require it.) 

The most obvious quantum analog of the integral expression (E::SG-class) is the fa- 
miliar operator expression, 

S« = tr(-plogp), (Ev.Sl) 



p being the density operator. In thermal equilibrium, as represented by the "canonical 
ensemble", p = Z(T)~ X exp(— H/T), (E::S1) yields an entropy for which the First Law of 
Thermodynamics is recovered as the exact relation 

d<H>= TdS {1) , 

where < H >= tr(pH). The definition (E::S1) therefore gives a consistent account of the 
entropy in equilibrium. The principal importance of entropy, however, stems not from its 
behavior in equilibrium but from the way that it governs the approach to equilibrium in 
accordance with the second law of thermodynamics. That is, its importance lies in its 
tendency to increase with time. 

Almost as familiar as the expression (E::S1), however, is the fact that its increase 
is incompatible with unitary evolution. Much as in the classical setting recalled above, 
the usual response to this difficulty is to attempt to modify either the expression for S 
or the law of evolution of p, or both. One introduces some type of "coarse graining" 
into the description of the system, leading to a new p acting in a new Hilbert space and 
to an approximate equation of motion for this coarse-grained p which is nonunitary. ; 
The one coarse graining known to us where the new equation of motion is not obviously 
approximate is that consisting in ignoring everything veiled by the surface of a black hole 
(event horizon). Such a coarse graining is also less "subjective" than most others. These 
features are part of what makes the black hole case so interesting. ] As long as the new 
evolution remains linear, there are general theorems guaranteeing the increase of S(p), 
under the condition that the "totally random" p oc 1 is a fixed point of the evolution 
[R::ref-on-increase] [R::Raf2]. Thus one recovers an exact, quantal second law for the 
coarse-grained entropy S^\ The fact that we obtain an exact law suggests that (E::S1) 
in this case should be interpreted as a Gibbs entropy (with p interpreted as describing an 
ensemble.) Recall however that (E::S1) will reduce to a Boltzmann entropy if we choose 
an appropriately "reduced" or " collapsed" operator p. 

Now the definition (E::S1) might look both familiar and natural, however, if all we 
demand of our definition of entropy is that its value increase with time, then the same 
theorems that guarantee the increase of (E::S1) work also for many alternative expressions, 
including all of the form 

S (2) =trF(p) (E::S2) 
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where Fix) is an arbitrary concave function [R::Raf2]. t Similarly the expression 

S& = -logtrp 2 (E::S3) 

has also been suggested as an alternative and can be shown to be nondecreasing under 
very general assumptions. This last alternative is particularly attractive since it is additive 
for uncorrelated subsystems and takes the value logiV for a macrostate composed of N 
equiprobable microstates. How, then, is one to select the correct entropy? 

In practice, the issue is completely irrelevant in most cases because, in some impre- 
cise sense, a single mesostate contains almost all the populated microstates, fluctuations 
away from that mesostate are negligible, and all entropy expressions (of both the Gibbs 
and Boltzmann variety) give equivalent answers. In principle, however, there can exist 
situations where arbitrarily large downward fluctuations in the (Boltzmann) entropy are 
highly probable, and these situations can provide guidance in the selection of the correct 
expression for the Gibbs entropy. 

The type of situation we have in mind involves very large fluctuations arising from 
the amplification of suitable quantal events. Consider, for example, a flammable substance 
inside a box, completely isolated from the outside world, with a quantum mechanical device 
that acts as a trigger to ignite the substance. The trigger might contain, for example, a 
half-silvered mirror and a device which will project a single photon at the mirror at some 
definite time t = to, in such a way that the photon will do nothing if reflected, but will 
cause the substance to be ignited if transmitted. 



' The following theorem is proved implicitly in [R::Raf2], although the statement of the 
theorem there is limited to the case of Fix) = — xlogx. Let 7i be a finite dimensional 
Hilbert space and let V be the space of positive hermitian operators (density matrices) on 
7i. Let T be a linear, trace-preserving map of V into itself which has the identity operator 
1 as a fixed point. Then, for any convex function F : [0, 1] — ► H and any (normalized) 
density matrix p, we have trF(p) < trF(T(p)). This implies the law of increase of the 
entropy (E::S2) (at least in finite dimensions) if the following conditions hold: (i) evolution 
pit) — > pit + At) is a linear, trace preserving map on the space of density matrices; (ii) 
The "maximally random state", p = l/(trl) is preserved by the time evolution; (iii) the 
evolution is Markovian in the sense that p(t + At) can be determined from p{t) at any 
single earlier time t. 



Let the entropy (E::S1) of the contents of the box before to be Sq, and let the resulting 
entropy corresponding to the case where the substance does not (respectively does) ignite 
be Si (respectively Su). Clearly So ~ Sj <C Su. Then, for an external observer, the 
entropy at time t c > to, but still before the box is opened, must, according to (E::S1), be 
taken to be S(t c ) = ^Sj + \Su + log 2, where we have assumed that the probability of 
reflection at the half-silvered mirror is \. 

Now, let's consider what happens when the box is opened at ti > t c . With probability 
1/2, the substance will have burned, whence we will associate with the box an entropy 
Su, and we will be in the ordinary situation where the (Boltzmann) entropy has increased: 
Sfo) — S(t c ) ~ ^(Sii — Si) ^> 1. However, also with probability 1/2, the substance will 
not have burned, whence we will have to assign the box an entropy Sfo) = Si. So this 
time, Sfo) —S(t c ) = f (Si —Su) <C — 1, and the entropy will have decreased substantially! 
This seemingly paradoxical behavior is reconciled with our expectations when we notice 
that on average the entropy doesn't change*: < Sfo) >= §5/ + \Su ~ S(t c ). In sum, 
the law of entropy increase holds on average, but there can be arbitrarily large fluctuations 
in individual cases. 

We are used to seemingly paradoxical situations arising when Schrodinger's cat puts 
in an appearance, and so one might worry that a better interpretation of the quantum 
formalism would vitiate the account given above. For example, one might think that the 
entropy at time t c is really zero because the system is still in a pure state, or, conversely, one 
might think that interactions with the environment actually "reduce the wave function" 
at time to, whence the entropy at t c would already be the same as it is at t<i. Or one 
might worry that the entropy that fluctuates is merely some subjective entropy "for us" , 
the external observer, rather than an entropy associated objectively with the system itself. 



* In saying that it doesn't change, we neglect the small decrease by log 2 associated with 
the fact that the twofold alternative of burned vs. not burned has been resolved by opening 
the box. Properly speaking we should, in order to keep to a consistently defined Gibbs 
entropy, continue to retain both the burned and unburned cases in the ensemble (with, of 
course, the observer incorporated into the system), in which case the missing log 2 would 
be restored. (We are dealing here with essentially the same reduction in entropy that 
"Maxwell's demon" produces, and any analysis of that situation in relation to the second 
law should apply here as well, cf. [R::OPen].) 



These worries disappear, we claim, and the situation therefore becomes more dramatic, 
when it involves black holes, thanks to the teleological nature of the event horizon. 

We refer here to the fact that the size and location of the event horizon, on a given 
hypersurface E is determined by events occurring to the future of S. Consider, for example, 
a situation where, at some moment of time t c corresponding to a hypersurface S c , a 
Schwarzschild black hole of mass M is present, perturbed by the presence outside the 
horizon of a static body of mass m, sustained, by a cable; and an individual is holding 
an ax with which he or she might decide at t — to > t c to cut the cable. The area of 
the event horizon on E c will be A\ (which can be well approximated by 16nG 2 M 2 ) if 
our ax wielder decides not to cut the cable, but it will be a larger area A2, in between 
A\ and 16irG 2 (M + m) 2 , if s/he does decide to cut it. The main point is that this value 
can not be ascertained without knowledge of events occurring to the future ofE c . The ax 
handler, whose decisions we might feel uncomfortable describing physically, can of course 
be replaced by a quantum mechanical t device, as in the previous example, which will 
induce a measurement-like event at t = to. 

Just as in the previous example, we can expect to obtain large fluctuations of either 
sign in the entropy. Indeed it is natural to assume that the entropy on E c is given by 

S(t c ) = q(2nA 1 / K ) + p(2nA 2 /K) (E :: SB HI) 

where k = 8nG and where p is the probability that the quantum mechanical "choice" to 
be made at t will result in the cable being cut, and q = 1 — p is the probability of it 
not being cut. Below, we will attempt to demonstrate that the entropy (E::S1) is in fact 
well-defined in this situation and that its value is correctly given by eq. (E::SBH1). The 
large fluctuations then follow as before. * 



t It is not obvious to us that the random choice must be made on the quantum level, but 
it does seem safer to use a quantum event, since, on current understanding, the resulting 
choice is absolutely unpredictable by anything existing on or to the past of E c . 

* Macroscopically large fluctuations can also occur in conjunction with phase transitions. 
There, however, the fluctuations connect states of equal (total) entropy. The thermal p in 
such a case is spread out over many mesostates, illustrating clearly the sense in which p 
may be said to describe an "ensemble" rather than an individual system, and in which, 
consequently, the entropy associated to such a p is a Gibbs rather than a Boltzmann 
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Finally, we will argue that this type of situation, resulting from the teleological char- 
acter of the event horizon, will be extremely difficult, if not impossible, to describe and 
analyze in terms of state vectors associated with hyper surfaces, as in canonical formalisms 
for quantum gravity, and that the only framework that seems capable of treating them is 
a path integral or "history" one. In addition, we remark here that the black hole example 
illustrates, once again, the untenability of a semiclassical version of gravity like a theory 
based on G a b = k < T a b >, since that would put the horizon in the wrong place, no matter 
what happens. 

III. Further analysis of the box example: Which entropy does Schrodinger's 
cat like best? 

Consider once again our box B containing an incendiary device E connected to a 
quantum mechanical trigger Q (say a partially silvered mirror with transmission probability 
p, or a spin-1/2 particle with its spin in the x direction which at t — to is going to be 
subjected to a measurement of it's spin along the 9 direction such that the result "+" will 
occur with probability p and the result "— " will occur with probability q = 1 — p) in such a 
way that with probability q the device will be ignited at time t = to, and with probability 
p it will not be ignited. 

To simplify matters we will suppose that initially the box's contents E + Q can be 
represented by a single quantum state-vector \b > and that, after the burning (if it occurs), 
the contents can be represented (in virtue of suitable coarse graining, possibly involving 
ultra-weak environmental influences) by a single mesostate comprising a large number 

of equally probable quantum state-vectors \a,i >, i = 1 N, representing the possible 

microstates of the resulting hot gas and ashes. 

Now let's suppose as in the original Schrodinger's cat Gedankenexperiment, that we 
do not open the box until a much later moment t — t<i. At any time t c G (to, £2) the state 
of the box's contents will be represented by the density matrix, 

p(tc) — p\b >< b\ + (q/N){\ai >< a\\ + \a,2 >< 02I + •••• + \cin >< un\} (E :: state — £1) 
entropy. 
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At any time t after the box is opened, t > t 2 , we will have two possible situations: With 
probability p, the device will have remained unignited so the contents of the box will be 
described by 

p(t 2 ) {1) = \b><b\ (E :: state - t2I) 

With probability q, the device will have ignited, so the contents of the box will be described 
by 

p{t 2 )^ H) = (l/iV){|ai >< ai| + \a 2 >< a 2 \ + .... + \a N >< a N \} (E :: state - t2II) 

Now let's see what has happened to the entropy in each case. To start with, the 
entropy at t = t c is 

S(t c ) = -p log(p) - q log(o) + q log(AT) w q log N, (E :: 5*1) 

while at t = t 2 we have, with probability p, 

S(t 2 )W =0 

and with probability q, 

S(t 2 )W = log(iV) 

Thus the entropy can either increase by plogiV or decrease by glogiV in the process of 
opening the box. On the other hand the average entropy is just 

< S(t 2 ) >= qlog(N) 

which differs from S(t c ) only by the quantity S' = —plog(p) — glog(g), corresponding to 
the fact that the alternative that is unresolved at t c is resolved at t 2 with a corresponding 
gain of information. 

Two aspects of the above example are worth pointing out. First, the fluctuation of 
the entropy is large: 

AS(t 2 ) = (< S(t 2 ) 2 >-< S(t 2 ) > 2 ) 1 / 2 = jp-q log(iV) 



i.e. is of the order of < S(t 2 ) > itself. And second, the requirement that the change in the 
average entropy after the box is opened arise only from the information gained about the 
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resolution of the initially unresolved alternative is enough to single out almost uniquely 
the expression (E::S1) for the entropy. 

To see this let's compute the entropy of the system at t c using, instead of (E::S1), the 
generic formula S^ of (E::S2). The result is 

S(t c )=F(p)+NF(q/N) 

At t — t 2 we have, using again (E::S2), that with probability p, 

S(t 2 ) = F(l) 

and with probability q, 

S(t 2 ) = NF(1/N) 

Thus the average entropy is just 

< S(t 2 ) >= pF(l) + qNF(l/N) 
Hence the change of entropy associated with the opening of the box is 

S < S >= pF(l) + qNF(l/N) - F(p) - NF(q/N) (E :: dTS) 

We expect that, since the only change occurring with the opening of the box is the res- 
olution of the alternative related to the action of the quantum trigger Q, the change in 
entropy can depend on p and q but not on the specific nature of the incendiary device, so 
5 < S > must be independent of N. Requiring this, we obtain 

98 QH* = V F (1/N) - F{q/N) - (q/N)[F'(l/N) - F'(q/N)] =0 (E :: eqF) 

where the prime denotes differentiation with respect to the function's argument. Putting 
x = 1/N, taking the derivative of (E::eqF) with respect to q, and setting q — 1, we obtain 

x 2 F"(x)-xF'(x)+F(x) = (E::eqF2) 

The general solution of this differential equation is F(x) = C\x log(x) + C 2 x where C\, C 2 
are arbitrary constants. Therefore the expression for the entropy becomes: 

S = tr(F(p)) = Ci tr(plog(p)) + C 2 (E :: NewS) 
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where we have used the fact that the density matrix is normalized to tr(p) = 1. It is 
also easy to see that the expression S^ of (E::S3) does not satisfy our condition of N- 
independence; it therefore is also ruled out as an alternative to (E::S1). Thus, the analysis 
of this Gedankenexperiment has led us, on the basis of very natural requirements to a 
unique expression (up to normalization and an additive constant) for the entropy of a 
system described by the density operator p. 

IV. The example involving gravitational collapse 

Consider a static spherically symmetric thin shell of mass M, in a similarly static, 
spherically symmetric asymptotically flat spacetime. The shell is fitted with a quantal 
device that at time t — (according to an internal clock) will make a random choice 
between triggering or not the collapse of the shell, which would result in the formation of 
a black hole. More concretely, imagine the shell as made of two thin massless concentric 
spherical reflecting walls separated by a small distance, with electromagnetic radiation 
confined between them. The triggering device may be imagined as before, except that this 
time, instead of igniting a fire, it makes the internal wall transparent to radiation when it 
is activated. Thus, with probability 1/2 (say), our static shell of radiation will become a 
null collapsing shell at t = 0, resulting in the subsequent formation of a black hole with 
mass M. 

One could worry about the feasibility of synchronizing the change in the transparency 
of the different parts of the internal spherical wall without having to delay the collapse for 
a time comparable to the light travel time across the shell after the quantum mechanical 
decision has been made. We do not believe this poses a problem. We can easily synchronize 
the change in, say, two opposite parts of the shell by using an EPRB device as our trigger 
mechanism: Take a spinless particle at the center of the shell which decays into two 
photons. Let's fit the internal wall with detectors that will measure the helicity of the 
photons and give each of them instructions to change the transparency if the photon it 
detects has positive helicity, but not if it has negative helicity. In this way, a coordinated 
collapse of the shell will start at opposite points in the shell, without the need to propagate 
signals across the shell after the quantum choice is made. One can readily extend this 
synchronization from a pair of antipodal points to the entire shell by means of correlated 
many-particle states. 
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The metric outside the thin shell is, of course, the Schwarzschild metric: 

ds 2 = -(1 - 2GM/r)dt 2 + (1 - 2GM/r)~ 1 dr 2 + r 2 dtt 2 (E :: metric!) 

for r > R s hell, an d the metric inside it is the Minkowski metric: 

ds 2 = -dT 2 + dR 2 + R 2 dQ 2 (E :: metricl) 

for R < R s heU- We approximate the shell as infinitely thin. The matching of the exterior 
coordinates (£, r) with the interior coordinates (T, R) can be deduced from the requirement 
that the metric induced on the shell from the exterior spacetime must coincide with that 
induced from the interior spacetime; while the trajectory of the shell (in the case where 
it does move) can be deduced from the requirement that it move at the speed of of light, 
or more formally, from the requirement that the induced metric thereon be degenerate 
[R::Isr]. 

Let the motion of the shell be given by specifying the functions r s heii = R^(t), in 
terms of the exterior coordinates, r, t, and R s heii — R^ 2 '(T), in terms of the interior 
coordinates, R, T. For the metric induced from the exterior spacetime, we have 

da 2 = -[(1 - 2M/r) - (1 - 2M/r)~ l (dr /dtf]dt 2 + r 2 rf0 2 , (E :: imetricl) 

while for the metric induced from the interior spacetime, we have 

da 2 = -[1 - (dR/dT) 2 ]dT 2 + R 2 dQ 2 , (E :: imetric2) 

(where we have taken 8irG = 8tt). Agreement of these expressions requires r $ h e ii = Rsheih 
or in other words, R^(t) = R( 2 *(T), together with 

[(1 - 2M/R) - (1 - 2M/R)- 1 (dR/dt) 2 ]dt 2 = [1 - (dR/dT) 2 ]dT 2 (E :: Ttl) 

These matching conditions, which let us relate the interior to the exterior coordinates, 
work out slightly differently in the timelike and null cases. 

Let's choose our coordinates so that the "moment of decision" is T = t = 0. Then for 
t, T < the shell is static and we have R = Rq, where Rq is the initial radius of the shell. 
In this case, we obtain from (E::Ttl) 

T = y/1 - 2M/R t (E :: Tt2) 
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If the shell fails to collapse, then (E::Tt2) remains true for all time. On the other 
hand, if the shell collapses as a null shell starting at t = 0, we can obtain (for T,t > 0), 
both T and t as functions of R from the condition that both sides of (E::Ttl) vanish, 
i.e. that the induced metric on the shell be degenerate. From the right hand side of this 
equation we obtain R(T) = Rq — T, and from its left hand side we find 

,R(t) -2M N 
BW - Bo + 2Mlog(^- i j ir ) = - t 

(where we have used the initial condition R(t = 0) = Rq). 

Obviously, the collapsing shell will cross the Schwarzschild radius at t = +oo, T = 
Rq — 2M. But, in fact, the horizon will be formed earlier than that. To determine when, 
consider a light signal starting at the center of the shell at T — T\ and traveling radially 
outwards. It will be able to escape to infinity iff it reaches the shell before the collapse 
has occurred. That is, it must reach the shell while one still has R s heii > 2M. The signal 
travels according to R = T—T±, whence it will meet the shell when T—T\ = R(T) = Rq—T, 
that is to say, at T = (1/2)(Rq + T\), at which time R s heii = (l/2)(-Ro ~ T\). So, the 
signal will escape iff T\ < Rq — AM. If we take, for example, the initial shell radius to be 
Rq = 3M, then the signal must leave the center with Ti < — M to be able to escape, and 
the origin at T > —M is already inside the horizon, if it turns out that the collapse is in 
fact triggered at T = 0. 

To summarize, the locus of the horizon at times earlier than T = depends on what 
happens at T = 0. If there is no collapse, there is of course no horizon. If the collapse 
occurs at T = then the locus of the horizon at earlier times is given by 

T - R = R - AM. 

Consider now a Cauchy hypersurface S tc with t c < defined by the condition t = t c 
outside the shell and by the corresponding condition T = T c = (1 — 2M/R ) 1 ^ 2 t c inside 
the shell. What is the area At of the intersection of the horizon with £+ ? If we choose 
t c > —(AM — Rq)(1 — 2M/Rq)~ 1 / 2 , and at t = the collapse is in fact triggered, we will 
have 

A tc = AnR 2 c = An(AM - R + (1 - 2M/R ) 1/2 t c ) 2 (E :: Atcl) 
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For example, if we choose Ro = 3M, we will have a nonvanishing area for t c > —\f?>M (if 
the collapse is triggered) and its value will be 

A tc = 4tt(M + £ c /v / 3) 2 , (E :: Atc2) 

so for t c = — ^M, say, we will have At c = nM 2 . Of course, if at t = the collapse is not 
triggered, we will have At c = 0. Note that by taking Ro sufficiently close to 2M we can 
have t a nonzero intersection of Et c with the horizon as early as desired in exterior time 
t c ; however, we will always have T c > — (AM — Rq) > —2M when we have such a nonzero 
intersection. In all these situations, the area A tc will be bounded by 16nM 2 , of course. 

We have considered as a natural choice, the foliation of the region of spacetime prior 
to t = 0, T = by hypersurfaces E ic that are orthogonal to the timelike Killing field 
present in this region ((-§i) a outside the shell, and (-§f) a inside). Other, equally natural 
foliations exhibit the same anticipatory behavior of the horizon. Consider, in particular, 
the foliation by hypersurfaces ££ which coincide with the previous ones outside the shell, 
but are continued inside the shell as unions of radially ingoing null geodesies. * Such a ££ 
will be defined in the outside by the condition t — t c , and in the inside by the condition 
T + R = T c + R = (1 - 2M/R ) 1 / 2 t c + R . If the collapse is triggered at t = 0, then the 
intersection of EJ with the horizon will have a radius 

R* c = (1/2)((1 - 2M/R ) 1/2 t c + AM) , (E :: Re) 

as long as t c > -4M (1 - 2M/ J R )" 1/2 , and an area A* tc = w[(l - 2M/R Q ) 1 / 2 t c + AM] 2 . 

In concluding this section we would like to emphasize the apparently objective char- 
acter of the ambiguity in the magnitude of the horizon area in the above example. It 



' Assuming that it is possible, in principle, to build a shell arbitrarily close to the Schwarz- 
schild radius. 

* An ingoing null surface is particularly apropos in connection with the scheme proposed 
in [R::Raf2] and [R::chandra-adelaide] for actually proving, from first principles, the non- 
decreasing character of an entropy like that discussed in section VI below. We presume 
that, in fact, the quantum gravitational second law will require the entropy to increase 
along an arbitrary well-defined foliation, just as the classical second law does. 
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is not that "we avoid finding out" whether a horizon exists (as with the Schrodinger cat 
example), but that it is objectively impossible for anyone to find out, given access only to 
information available on the given hypersurface. 

V. Path integral evaluation of the expected horizon area 

In this section we sketch a path integral calculation that would justify the expression 
we used earlier for the expectation value of the area of the horizon on Et c : 

< A > = 1/2 x + 1/2 x A tc 

where A tc is the area of the horizon's intersection with E ic in the case that the collapse is 
triggered at t — . 

Before we begin the calculation, we review the path integral approach to ordinary 
non-relativistic quantum mechanics. The basic ingredients are paths (or "histories") 7(t), 
i.e., curves 7 : / — ► T, where / is a time interval, and T is the configuration space of 
the system in question. The outcome of the formalism is the assignment of generalized 
probabilities P (positive real numbers) to certain classes C of paths. In special situations 
these numbers P can safely be interpreted directly as probabilities, but in general they 
cannot, and recourse to a more subtle interpretive scheme is necessary. ' To avoid confusion 
with other uses of the term "probability" , we will refer to the number P = n(C) associated 
to a particular class C of paths as the quantal measure of C. 

By definition, a class C of paths is specified by the imposition of certain restrictions 
on 7. For example, if the system has been pre-selected to be at a certain q$ e T, at time 
t — to, and one is interested in where the system will be at time t — ti, given that in the 
intervening time period the paths accessible to the system are those in the subset C, then 
the classes of interest may be denoted i C qi ' , by which we mean the class of paths that 
begin at qo at t = to, belong to C, and terminate at qi at t = t±. The quantal measure of 
C qi is then (formally) 

MCgJ = p (Qi, h, qo, h; C) = iV|£ 76C)7(to)=g0)7(tl)=gi e iS[7]t o | 2 (E :: pi) 



' Such interpretive schemes are described, in more or less detail, in [R::jim-et-al] [R::drexel] 
[R::isham]. 
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where S^]/ = L 1 L[y(t f )]dt f . Here, we have introduced an optional normalization con- 
stant N which can be chosen to make the measures add up to unity if desired: 

P(q 1 ,t 1 ,q ,t ;C)dq 1 = l (E :: p2) 

qier 

The intervals of definition of the curves in the class C must, of course, include any time 
period used in the specification of C. In the gravitational case, we will see that this can 
make it necessary to consider paths that go into the future and come back to the past. 

Now take another example in which all the paths under consideration obey both 
7(^0) — <?o and 7(^1) = Qi , while our interest is in the value of q at a pair of intermediate 
times t and t[ between between to and t\ (to < t' Q < t' x < t\). In such a case, the class 
C might be specified by a pair of characteristic functions Co and C\ of q at t = t and 
t — t\, respectively, such that 7 G C iff both Cq and C\ take the value 1. Then eq. (E::pl) 
becomes (we omit the optional coefficient N) 



» ~ • • ... lS W!,i , ,,„ iS[f]*} 



P =|E 7(toHg0)7(ilHgi e^W^o( 7 (t ))e m ^ 1 ( 7 (t' 1 ))e ° l7J *i | 2 

iv v v iS[-r]t°r> t >\ iSll] t' n ( /n iSr W«/ .2 

= |^^gi^7(to)=«o,7(*i)=9i,7(*i)=9j,7(ti)=«ri e ° G o(g ) e ^lUl) e i| 



= |E g /E g /E 7( ^ )= g/ j7(ti)=g /*J(g 1 )e '°*o(g )l 
where we have defined the "wave functions" ^o(<Zo) = ^7(to)=5o,70')=9' e * °Co(q' ) and 

• or 1*1 

^1(91) = ^7(*i)=9i,7(t' ')=q' e% Ci(?i) at t = t and t = t^ respectively. In this way, all 
the conditions on the path referring to times t < t' or t > t\ are condensed into the wave 
functions \l/o and ^\. Such wave functions will be definable whenever there exist times 
t' < t'i such that the full set of conditions on 7 is the conjunction of conditions referring 
solely to t < t , t G [t , t' x ] and t > t[, respectively. 

Both of these examples are rather special. A more typical situation is that in which 
we have an "initial" wave function \1/ at t = to (providing information on the behavior of 
7 for times prior to to) and our interest is in classes C whose definition refers to times 
t G (to, ti), with no final condition on 7(^1) (and therefore no relevant final wave function). 
In such a case, we have 

fi(C) = E (?l6r |E go E 76C . j7 ( to ) = g 0)7 ( tl ) =gi e l 7 *o^(g )| 2 = E gi6 r|E 76 c, 7 (t 1 )=g 1 »4.[7]tJ| 2 

(E::p7) 
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where we have defined the amplitude .4(7] t J = e l [7Jt o\I/(7(to)). 

Using eq. (E::p7), one can define an "expectation value" for any physical quantity 
represented by a path functional -^(7). (For example, for a system consisting of a single 
particle moving in one dimension, F might be the position operator at some time t a , -^(7) = 
x (l(ta)), or the velocity operator at some other time t&, -^(7) = ^x(^(i))\t=t b )- Since 
a quantum measure does not obey the probability sum rules, the ordinary probabilistic 
concept of expectation value does not automatically carry over, but a convenient definition 
for present purposes is the following. Consider the range of values that F can take within 
C, which for notational simplicity we take to be the discrete set (/1, f^, ^3, • • •)? an d define 
the class C; by 

C z = {7 e C\F{i) = /J (E :: d) 

Obviously, C = UjCj. Then we can define 

< f > = e fi mq) / E ^°i) ( E :: ave ^ 

i i 

Another natural definition might be 

< F > = 3* I J2 AirAl\F(l) I / E A[l\*Al) , (^ " ™e - aft) 

where the sums are over all 7,7 G C such that 7(^1) = 7(^1), and '9ft' denotes "real part 
of". In general, this differs from (E::ave), but using it instead of (E::ave) in the present 
context would not alter our main result (E::ea2) below. 

It is often convenient [R::Raf] to consider instead of the path 7, a new path 7' = 7 -1 , 
obtained from 7 by running it from the future, t\, to the past, to- We must then remem- 
ber that, if a forward running path is assigned the amplitude A, then the corresponding 
backward running path is assigned the amplitude A*, a rule that will be important to us 
later. We then can write, under the conditions that led to (E::p7), 

<F>= E F{l)Al]Al'\ I E Al\Al'\ (E::F4) 

7,7'eC ' 7,7'eC 

F( 7 )=F( 7 ') F( 7 )=F( 7 ') 

where the sums are over all 7,7' G C with the additional requirements that (z) 7(ti) = 
7'(ti) and (u) -^(7) = F(p/'). Here the path 7 is forward running from to to ti, and the 
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path 7' is backward running from t\ to to- (For a backward running path 7', we interpret 
"7' G C" and "^(7')" with respect to the corresponding forward running path, (7') _1 .) 

Consistency demands that all these expressions be independent of ti, as long as t\ is 
taken to be late enough to ensure that the paths are defined at all times that are relevant 
for the imposition of the conditions C and Ci, and this independence follows, in ordinary 
quantum mechanics, directly from unitarity. [R::logan] 

Now let's adapt this formalism to general relativity. The analog of P(qi, ti,qo,to) C) 
in equation (E::pl) is 



P(hi,(f)i,ho,(f)o;C) 



£ 

{g,4>)ec 



e iS[g,<j>]*l 



(E :: ps) 



where h = h a b and <p are respectively a Lorentzian metric and a collection of non-gravita- 
tional matter fields on a spacetime manifold M with initial boundary £0 — 9qM and final 
boundary Si = d±M, and S is the action-integral for the gravitational and matter field 
history including the surface terms required to make it additive. The sum* (formal, as 
always) is restricted to pairs 7 = (g, (p) that induce the metric hi and the matter fields 
<pi on Sj (i = 0, 1) and that satisfy the additional conditions defining the class C of 
histories [R::Hawk2]. In a cosmological setting, a further condition must -- plausibly - 
be imposed on (g, cp), namely a constraint on the total spacetime volume f M ^J—gd A x of 
M. This "unimodular" condition is suggested by analogy with nonrelativistic quantum 
mechanics [R::logan] and appears to improve the convergence of the cosmological path 
integral while yielding more physically acceptable results [R::jorma-unimod]. Even if it is 
adopted, however, it probably has no influence on the asymptotically flat path integral we 
are concerned with here. 



* In theory, the sum extends over all diffeomorphism classes of manifolds M as well as over 
all inequivalent ways of attaching M to £0 an d £i- It seems to us unlikely, however, that 
any such functional integral cum topological sum could be well defined, and we expect 
this formal expression will give way, in the correct theory, to a genuine finite sum over 
causal sets [R::causet] or other discrete structures. In this connection, we note also that, 
in omitting the sum over topologies, we have omitted as well the complex weight-factor 
that one can include with each distinct topological class. Each consistent choice of weights 
yields a different quantum "sector" , with the choice of sector determining, for example, 
whether a certain topological geon will be a boson or a fermion [R::fay-rds]. 
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For the expectation value of the history functional -^(7), we can use the same ex- 
pressions, (E::ave) and (E::F4), as above, remembering always that the histories must be 
extended sufficiently (run up to a late enough time) so that the value of the functional F can 
be evaluated for each one of them. Recall also that the sum in (E::F4) is restricted to pairs 
of histories that induce the same boundary data on £1 and are such that ^(7) = F('j'). If 
the initial condition on £0 is specified by a wave function ^o(h a b, 4>)i then the amplitude 
that enters into (E::F4) is, for a forward running path 7 = (g, <p), 

A(<y) = e S[9 ^l^ (hab, (/>) (E:: amp) 

where \l/o is evaluated at the configuration (h, 4>) induced on £0 by the history 7 = (g, <f>). 

Now let's apply the formalism to the specific case of interest: the evaluation of the 
expected area of the intersection of the horizon with the hypersurface T, tc of the previ- 
ous section. We take the initial ^o, defined on some initial Cauchy surface £0, to be a 
quasiclassical wave packet corresponding to the classical data describing the static start- 
ing configuration of metric and matter shell described in the preceding section (so ^0 is 
centered on the metric hP ah and matter fields 4>° induced on £0, by the pre-collapse clas- 
sical metric and fields, and is approximately constant in the neighborhood of (h°,<p°), 
corresponding to the static nature of the classical data) . In addition, of course, \l/o must 
also describe the appropriate, non-stationary state of the quantum mechanical triggering 
device. 

The class C of histories we must consider is that of asymptotically flat spacetimes (in 
order for the horizon concept to be meaningful) , and they should, in principle, extend into 
the future for an infinite time (relative to infinity) for the same reason. Once again the 
teleological nature of the horizon manifests itself, forcing us to consider paths that reach 
into the distant future, in order to compute the expectation value of a functional that 
could naively be thought to depend only on information associated with the hypersurface 
£t c . The functional of interest to us is ^(7) = A tc , the area of the intersection with 
the hypersurface £t c of the horizon associated with the spacetime 7. Thus, in principle, 
everything is set up and all that remains is to use equation (E::F4) to evaluate < A tc >. 

The key observation now is that the path integral will be dominated by classical 
histories (i.e., ones that satisfy the classical equation of motion), given that we start from 
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an almost classical initial state; except that, as a consequence of the binary choice made 
by the quantum mechanical trigger, there will be two classical histories (rather than only 
one) that, together with small fluctuations around them, will contribute. Let's call these 
histories 7+ (the history with the black hole) and 7_ (the history without the black hole) . 
Then, we will have, to an excellent approximation, only four terms contributing to the 
sum (E::F4). (More precisely we will have four sets of terms, but no harm will result from 
absorbing the contributions of the fluctuations into the amplitudes .4.(7).) 

In fact the off diagonal terms will vanish and only two terms will remain. Indeed, 
they vanish twice over, so to speak: first because the two histories 7 + and 7_ induce 
macroscopically different data on the final hypersurface Ei, while in (E::F4), only pairs of 
histories inducing equal data contribute to the sums; and second because -F(7+) 7^ F(7_), 
while only pairs with equal F contribute to the same sums. Hence we find for the expected 
area, 

<F> - M~ I+ )A W)+ A M A«_) =2- 4 ' < E::e « 2 > 

where A denotes the area of the horizon on Et c in the case that the collapse does occur. 

It is also easy to find the magnitude of the fluctuations in the area, which we can 
measure by the standard deviation of F, 

AF = (<F 2 >-<F > 2 ) 1 / 2 = (-A 2 - -A 2 ) 1 ' 2 = -A (E :: fal) 

For a macroscopic black hole the fluctuations are therefore enormous. 

In writing the last two equations, we have used the formulas of ordinary probability 
theory. In other settings this could be questioned, but there is little doubt that it is ap- 
propriate here, because the only two (sets of) histories that matter differ macroscopically 
from each other. In this situation (of "decoherence of macroscopically distinct alterna- 
tives") the quantum measure effectively reduces to a classical probability measure, and 
one may safely utilize all of the concepts of classical probability theory. The vanishing of 
the off diagonal terms also has the pleasant consequence that the expression (E::ave-alt) 
that we introduced briefly as an alternative definition of expectation value would have 
yielded exactly the same results as the definition (E::ave) that we did use. 



22 



VI. Path integral evaluation of the entropy 

Heretofore, we have worked under the assumption, represented in equation (E::SBH1), 
that the entropy associated to the Cauchy surface E ic would be (up to small corrections) 
the expectation value of the horizon area on Et c . In Section V, we were able to evaluate 
this expectation value using an extension of the path integral formalism, and we found that 
it had the value one would expect, namely | x + ^ x A, A being the area of the horizon's 
intersection with Ef c in the case that the collapse occurs. To do this computation, we were 
forced to extend the spacetimes that entered into the sum over histories well beyond the 
surface on which we were trying to evaluate the area, in order to come to grips with the 
teleological definition of the event horizon. ' 

This computation of the expected area corroborates our earlier assumptions in part, 
but it still doesn't prove that the expected area can be identified with the entropy. In the 
present section we attempt to obtain this identification from first principles. That is, we 
seek an expression for entropy that works in general, but that can be shown to reduce to 
the average area in the situation at hand. In seeking such an expression, we will be led to 
a still more radical extension of the sum-over-histories formalism, namely the inclusion of 
histories that double back in time. 

We will be assuming that, in more ordinary situations where the location of the 
horizon is well defined (up to merely microscopic fluctuations), the expression — trplogp 
yields the standard formula S = 2ttA/k when we take p to be the density operator p ext 
for the gravitational degrees of freedom of the black hole exterior (cf. [R::chandra-symp] 
[R::entangle]), i.e. we assume that 

_ tr ( p exterior logp exterior) = 2n A/ K (E :: entangle) 

in such ordinary situations. Our aim here, of course, is not to justify this assumption 
per se, but only to show that, on its basis, we can generalize the area law to situations 
involving large scale fluctuations of the horizon. 



t A similar extension of the histories has been advocated as a way to extend the path 
integral formalism to spacetimes with closed causal curves [R::time-cycles]. 
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Turning now to the pre-collapse situation of Sections IV and V, let's assume the initial 
state (on some hypersurface E = Eo corresponding to a time before any horizon can have 
formed) to be described by a wave function ^>q(X) taking values on the space of 3- metrics 
and the other field variables on Eo, i.e. X = (h a b, (f>). The corresponding density matrix is 

p (X,Y) = *o(X)*o(Y)* 

At a subsequent hypersurface E corresponding to a later time t, the system is described 
by the wave function 

7 

where the sum is over histories beginning in Eo and ending in E and which coincide on E 
with X. The corresponding density matrix can be expressed as 

p(X,Y) = ]Te^M- s M)tfo(7(io))*o(7%)r 

7,7' 

= ^e'( 5 W- s MVo(7(*o),7'(io)) (E :: dmeval) 

7,7' 

Now in the case where we can separate the degrees of freedom of the system into two 
groups A and B associated, let us say, with two different regions of E, we can write the 
density matrix as 

p((X A ,X B ),(Y A ,Y B )) 

and then compute the reduced density matrix for the degrees of freedom A in the standard 
way: 

p(X A ,Y A ) = Y t P((X A ,X B ),(Y A ,X B )) 

X B 

In the present situation, we will want to identify the variables X A with the region of E 
exterior to any horizon that may be present and the X B with the black hole region interior 
to the horizon. The problem is that the location of the horizon cannot be ascertained from 
the field data X = (h, 4>) on E. Rather, we must, as in the previous section, locate the 
horizon with respect to a spacetime that extends sufficiently far to the future of E. Here, 
we will employ the same method as there, but with the added twist that, since we now 
need to define a density operator p and not just a path functional ^(7), we will need our 
paths to return to E after their excursion into the future. 
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Before continuing, let us pause to analyze a simpler, but analogous situation. Consider 
the superposition of a single photon state with a state in which a pion is present together 
with the photon: 

I* >= ^2c k \k > |0 > +^2<ik,p\k > \p >, (E r.pipho) 

k k,p 

where the first ket in each term corresponds to photons and the second to pions. The 
equivalent density matrix |\1/ >< \I/| is 

p = y~]cfc4/|fc > [0 >< 0| < k'\ + y^ dk,pd* k/p ,\k> \p><p'\ < k'\ 

k,k' k,k',p,p' 

+ ^ c kd-k', P '\ k > 1° >< p'\ < k'\+ ^ d k ,pC* k ,\k > \p >< 0| < k'\ . 

k,k',p' k,k',p 

If we want the reduced density matrix for the photon, we must trace over the pion degrees 
of freedom, obtaining 

Pphoton = y~]c fc Cfc,lfe >< k'\ + ^2 dk, P d* k , iP \k X k'\ . 

k,k' k,k',p 

V v ' N v ' 

0° O 1 

'photon 'photon 

Notice that Pp hoton , which arises from the zero-pion sector, is still a pure state, while 
Pphoton' w hich arises from the one-pion sector, is in general highly mixed. Notice also that 
the interference terms drop out upon tracing and do not contribute to the reduced density 
matrix. Notice finally that, if the state (E::pipho) corresponds, for example, to an energy 
eigenstate, then we will have Pphoton Pphoton = 0- ^ n these features will have analogs for 
us, with the pion playing the role of the black hole interior, and the photon that of the 
fields outside the black hole. 

Now let's proceed to evaluate the entropy of the state associated with E ic . First we 
evaluate the density matrix at t c using (E::dmeval) but* taking the class of paths that 
start at to, go to very late times, and come back to t = t c . As before, the result will be 



* The point of using such paths is not that it changes p(X, Y) as such, which it won't, 
assuming unitarity. Rather, their use lets us locate the horizon and thereby distinguish 
the black hole interior from its exterior preparatory to tracing over the degrees of freedom 
of the former. Our treatment here glosses over some unresolved issues to which we will 
return at the end of this section. 
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dominated by classical or nearly classical paths. The subclass C of such paths can, in our 
case, be divided into two further subclasses: the class C\ of histories with a black hole, and 
the class C2 for which the corresponding spacetime does not contain a black hole. Then 
we have 

Pt e (X,Y)*> E ^ W - 5M W7(*o),7'(*o)) 

7,7'eC 

E + E + E + E W w - 5[7 ' ] W7(*o),7%)) 

v 7,7'eCi 7,7'ec 2 7eCi,7'ec 2 -yeC2,i'eCiJ 

(E :: dmeval2) 

Now we claim that tracing over the degrees of freedom associated with the black hole 
interior yields (in analogy with our photon-pion example) a reduced density matrix which 
contains no interference terms. That is, the instruction to take the trace over the degrees 
of freedom residing within the black hole will force us, in the case of the last two sums 
in (E::dmeval2), to compute the inner product between two partial wave functions, one 
of which corresponds to a flat metric in a macroscopic, spherical region (the region inside 
the black hole in the case where the collapse occurs) and the other of which corresponds 
to the empty set (the region inside the black hole in the case where no collapse occurs). 
This inner product should certainly vanish, and therefore there will be no contributions 
from the last two sums in (E::dmeval2). Thus we get for the reduced density matrix for 
the black hole exterior: 

p e t f (X,Y) = p P £\x,Y) + qpg\x,Y) , (E :: rhol2) 

where the superscripts 1 and 2 indicate the contribution arising from the black hole and 
the no black hole sectors respectively and where p and q indicate the corresponding prob- 
abilities (which we took in Section V to be p = q = 1/2). What is equally important, we 
claim that p\ and p t are orthogonal: 

(1) (2) (2) (1) n (T? ,, x 

Pt c Pt c = Pt c Pt c = ( E:: orthog) 

The reason, as before, is that the two correspond to entirely different types of (exterior) 
geometries, the former contains a flat region with a big spherical hole, the latter contains 
the same flat region with the hole filled in. 
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Now let's evaluate the entropy (E::S1) associated with pf xt . To that end we express 
(E::S1) as a series in terms of the traces of powers of p. Letting a := 1 — p, we have in 
general 

n—l n—1 n=l k=0 v 7 

Hence * 

tr(-plogp) = £^ ( k J ^-^tr(p k+1 ) . (E :: series) 

n=lk=0 ^ ' 

But from (E::orthog) we have 

tr (p ext ) n = tr(ppW + qp (2) ) n = tr(ppW) n + tr(qp^) n 

whence substituting p\ xt into (E::series) yields for the entropy 

S tc =tr(-p ext logp ext ) 

= tr(-pp (1) logp,9 (1) ) + tr(-qp (2) logpp (2) ) 

= ptr(-p (1) logp (1) ) +qtr(-p (2) log/9 (2) ) +plog(p _1 ) +qlog(q _1 ) 

That is to say, we have found that the entropy on E tc is - - up to the correction term 
5" = plog(j> _1 ) + glog(q ,_1 ), which is the entropy associated with the unresolved alterna- 
tive of having or not having a black hole on E tc - - the appropriately weighted average 
of the entropies of the two cases, "horizon present" and "horizon absent". In view of 
(E::entangle), we thus obtain the desired result 

S t =px — A + qxO + S' , (Er.ans) 

K 

(where, in our specific example in Sections IV and V, we had p = q = 1/2). 

Before concluding this section, we wish to call attention to certain shortcomings of 
our definition of a reduced density matrix p ext , related to the type of zigzag path we were 
led to employ. The matrix p(X, Y) has two slots corresponding to the two paths 7 and 7' 
in (E::dmeval). If we take both 7 and 7' to go to the future and return to S ic then both 



* We can't amalgamate the sums by collecting all the multiples of trp n , for each n, into a 
single term, because the series does not converge absolutely. 
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X and Y can be endowed with information on the horizon location (they will become in 
effect not pairs X = (h,4>), but triples X = (h,(f),b), where b is a "marking" telling us 
where the black hole region sits within £). In this extended sense (which is the one we had 
in mind in writing most of this section), the p ext (X, Y) that results from tracing out the 
interior degrees of freedom will be hermitian and positive, but not necessarily normalized 
to unit trace. On the other hand, starting from the alternative definition (E::ave-alt) of 
< ^(7) >, we can form a slightly different p ext , of which only the second slot partakes 
of a zigzag path. This p ext contains all the information needed to determine < ^(7) > 
for any functional of the exterior fields in the neighborhood of S tc . It is normalized, but 
not necessarily hermitian, and it yields, in our quasiclassical approximation, exactly the 
same value (E::ans) for tr(— plogp). In addition further small variations of the definitions 
are possible. Which of these p's, if any, furnishes the appropriate description of conditions 
outside the black hole? And does at least one of them evolve autonomously, as needed for 
the general proof of entropy increase offered in [R::Raf2]? Clearly, the answers to these 
questions lie in the full quantum theory of gravity. Indeed, one can hope that difficulties of 
principle such as these (together with some of the options for resolving them) will provide 
important clues to the characteristics that the fundamental theory of quantum gravity will 
have to possess. 

The developments in this section have been based on histories that begin at Sq, pro- 
ceed into the distant future, and then double back in time, returning to Et c . In connection 
with their use, it may be interesting to ask what is the nature of the wave function that 
one obtains by evolving \l/o via* such histories, or more specifically by evolving via subsets 
for which the collapse either is or is not triggered. The initial state \l/o describes a static 
geometry with nothing going on but the metaphorical ticking of the clock of the quantum 
trigger. Evolved forward to E ic , this wave function is still the same, except that the trigger 
is closer to its moment of decision. What will we get at £t c by, for example, continuing 
the evolution forward via the histories undergoing collapse, and then evolving backward to 
E ic ? One can show (assuming unitarity) that the result will be (up to small corrections) 
the same, except that the trigger will be in the state that would guarantee collapse (for 
example the correlated EPRB photons, if they are already in flight, will have positive he- 
licity), and vice versa for the histories with no collapse. To return to such a pre-collapse 
state from a late time black hole is of course highly "anti-thermodynamic" behavior — an 
example of the "Umkehreinwand" with real practical significance. If we continue to evolve 
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backward, we get states of increasingly more bizarre anti-thermodynamic content. For 
example, if the decision mechanism utilized a photon impinging on a half silvered mirror, 
then we would get a superposition of the photon emerging from its true source, with a 
photon emerging backward from whatever it is that absorbs the photon in the case that 
the collapse is to be triggered. 

VII. Further thoughts 

We have described two "Gedankenexperimenten" in which macroscopically large fluc- 
tuations of the entropy are induced by the amplification of microscopic quantum events. 
Along with the possibility of large entropy increases in these experiments comes the pos- 
sibility of large entropy decreases, which, however, are still consistent with the second law 
of thermodynamics in the sense that, on average, they are balanced by the increases. 

Now, these large entropy decreases are not the type of "reversal of the thermodynamic 
arrow of time" that would take place, if, for example, all the air molecules in a room 
suddenly decided to migrate into one corner. Rather, they occur when a certain kind 
of "quantal superposition" of a low entropy alternative with a high entropy alternative 
resolves itself into one or the other possibility in a manner reminiscent of "state vector 
reduction" . 

The question arises, Is the entropy S that fluctuates in these examples a "Gibbs 
entropy" or a "Boltzmann entropy", does it refer to an "ensemble" or to an individual 
system? If we were think of S as the Gibbs entropy of an ensemble then its downward 
fluctuations could seem subjective and unreal (and in fact would not occur at all if we kept 
the ensemble intact). On such a view, the entropy at time t c in the black hole example 
would be, instead of (E::ans), either 2ttA/k or depending on whether the horizon was 
"really present" or not. In neither case would it fluctuate, since each element of the 
ensemble would represent an essentially classical spacetime exhibiting the normal growth 
of horizon area with time. The trouble with this point of view is that there appears to 
be no physical basis on which to distinguish the one alternative from the other — at least 
on the basis of anything existing at time t c . Rather the entropy seems to derive from the 
system's potential to evolve in two very different ways. Thus, it seems to us better to view 
(E::ans) as a kind of Boltzmann entropy, but with the peculiarity that it is not in any 



29 



evident sense a measure of the number of microscopic complexions of the corresponding 
macrostate. (At best it is an average of such complexion measures.) On this view, the 
entropy fluctuations are objective and "real" , even if the traditional formula S = log N 
fails to tell the whole story. 

In any case, both the above views share the feature that the entropy at time t c can 
be quite large even though the spacetime is flat. 

Indeed, we believe that the reflexions presented in the present work have led to a 
coherent, internally consistent picture, that imputes a well-defined and computable entropy 
to hypersurfaces like £t c . This picture bears out the conception that, if an event horizon 
is present with some probability, then a corresponding entropy must also be present. 

We believe that the existence of this kind of entropy poses a severe challenge to anyone 
attempting to base a theory of quantum gravity on the method of "canonical quantization" . 
Let's suppose for a moment that we have the correct theory of quantum gravity, and 
moreover that it is formulated in a canonical manner; that is, the theory singles out some 
choice of classical canonical variables and identifies them to corresponding objects in the 
quantum theory. (These variables could be the induced metric and extrinsic curvature 
of the ADM formalism, the Ashtekar variables, or the string degrees of freedom to name 
some of the possibilities that have been considered in a canonical context.) If, moreover, 
we have also the correct theory for non-gravitational matter, either in a form truly unified 
with gravity as in String Theory, or not, then we should be able to evaluate any desired 
quantity for a fully specified state of the system. And for any "macroscopically fully 
specified" state of the system (that is, one for which the macroscopic degrees of freedom 
are fully specified), we should, in accordance with the principle of the statistical mechanical 
origin of thermodynamics, be able to evaluate all the relevant thermodynamical quantities. 

Consider now the problem of computing, from first principles, the entropy associated 
with a given configuration of gravitational and "matter" variables. This should be possi- 
ble to do (through the introduction of suitable ensembles constructed by coarse graining 
starting from the given state) , as long as at least the macroscopic degrees of freedom are 
fully specified. Moreover the answer should agree with the known thermodynamical result 
that assigns a contribution to the entropy equal to (2tt/k) of the horizon area to those 
situations involving a black hole. Now consider the state associated with the hypersurface 
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S ic in section III. The classical metric and canonical momentum on S fc are completely 
specified, and so are the macroscopic degrees of freedom of the matter. (Note, moreover, 
that even though this could not be said to be a strictly stationary situation, the only thing 
that is changing with time is the internal clock associated with the quantum mechanical 
device. Everything else is perfectly static, and in particular, the macroscopic metric around 
Et c possesses a timelike Killing Field.) Thus, the theory should assign to this macroscopic 
configuration a pure or mixed state of the underlying quantum mechanical variables, and 
therefore the entropy should be fully determined. However, as we have seen, the situation 
on E tc is such that the entropy lies in between two very different values, depending on 
a small detail concerning an energetically insignificant degree of freedom of the matter 
fields (related to the quantum mechanical phases in the triggering device). It follows that 
the purported theory of quantum gravity must be extremely sensitive to such things as the 
phases of the matter degrees of freedom if it is to successfully reproduce the result obtained 
in section IV. It is difficult to envision a canonical theory developing in such a way as to 
incorporate this very strange feature which seems necessary if the correct result is to be 
obtained from it. 

The following objections can be (and have been) advanced to counter our arguments: 
(i) Entropy should be defined only for stationary (equilibrium) states, (ii) The black hole 
entropy could be associated with the area of the apparent horizon rather than the event 
horizon, and the former is determined from data on E. (iii) It is unreasonable to expect 
the theory of quantum gravity to solve also the "quantum measurement problem" , which 
evidently is a central feature of the situation described in section III. We feel all of these 
criticisms are unwarranted, as we now explain. 

(i) This could be argued for the thermo dynamical entropy but not for the statistical 
mechanical entropy which is what a fundamental theory should yield (through the intro- 
duction of suitable coarse grainings, etc.). Moreover the situation on Et c is for all practical 
purposes static, and it seems hard to deny that in such a case we should have a well defined 
thermodynamical entropy. Finally, by arguing that there are situations in which entropy 
is not defined, one would be calling into question the status of the second law as an argu- 
ment against perpetual motion machines. In relation to black holes in particular, one loses 
the thermodynamic interpretation of the classical area increase theorem if one limits the 
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application of the entropy concept to stationary black holes (and similarly for that portion 
of the "first law" that covers perturbations to nearby non- stationary solutions) . 

(ii) First, our best evidence for the thermodynamic properties of black holes is based 
on the event horizon, and not the apparent horizon. Of course, the two coincide for station- 
ary black holes, but in the non-stationary case, we have for the event horizon, the classical 
area increase theorem as well as that portion of the so called first law that states that 
variations in the energy are proportional to variations in the area, even for non-stationary 
perturbations [R::TdA]. In contrast, the apparent horizon jumps discontinuously in dy- 
namical spacetimes, which it is hard to believe the physical entropy would do; and, to our 
knowledge, the possibility of sudden decreases in its area has not been ruled out. Second, 
the intuitive connection between entropy and hidden information is lost for the apparent 
horizon, given that one doesn't even know in general whether it divides spacetime into 
interior and exterior regions. (In fact, due to the discontinuous jumps, the apparent hori- 
zon will not divide every hypersurface of a general foliation.) Third, the apparent horizon 
seems too tightly tied to a smooth spacetime metric for the concept to survive in a fun- 
damental theory of quantum gravity. Already in simple quantum models [R::parker], it 
loses its property of being (locally) achronal (which is crucial for the autonomous evolution 
of the exterior region) and one may well doubt whether "off shell" it could escape being 
scattered in bits and pieces all over spacetime. In contrast, the notion of a black hole 
as the region causally cut off from infinity generalizes even to discrete settings, and by 
definition, the region outside is causally independent of the region inside, at least kine- 
matically. Finally, the apparent horizon itself is not local in time (unless with respect to 
some distinguished foliation of the spacetime manifold), and in this respect is no better off 
than the event horizon. Indeed, the Schwarzschild black hole spacetime has been shown 
to possess Cauchy hypersurfaces that contain no trapped surfaces [R::Vivek] and therefore 
contain no apparent horizon that could be recognized from canonical initial data. ' This 
looks like an unsurmountable obstacle to the idea that the black hole entropy should be 
associated with the area of the apparent horizon in preference to the event horizon. 



' We don't know whether adding information from the past (but not the future) would 
suffice. If not, then the apparent horizon would actually be worse off than the event 
horizon, for which, at least, knowledge of the future always suffices! 
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(iii) The significance of the proposed example is precisely that it strongly advances the 
opposite point of view - - at least if the words "measurement problem" are construed in 
a sufficiently general sense. The fact that issues related to the interpretation of quantum 
mechanics creep into the evaluation of a physical quantity associated with times earlier than 
those at which the pertinent "measurement" is to occur (something that to our knowledge 
does not occur in nongravitational physics; cf. however [R::osgood]) is, in our opinion, a 
further indication that some change in the formal structure of quantum mechanics will be 
a necessary step on the journey to a satisfactory theory of quantum gravity. * We expect in 
particular, that one consequence of this change will be the emergence of an interpretation 
of the existing formalism not tied to the idea of measurement on a hypersurface (or any 
other pre-specified spacetime region). 

The preceding arguments are of course very far from a proof that canonical theories 
are wrong, but they seem at least to pose a severe challenge to any such theory that is 
presented as the correct theory of quantum gravity and which in particular is claimed to 
successfully evaluate the entropy of a black hole. In this respect, it is worth noting the 
relative ease with which (modulo the subtleties discussed above) the path integral yields 
the intuitively correct result for the situation in question, of course under the assumption 
that it yields the standard result (27t/k)A in standard situations. 
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